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APPLICANT: Pharmexa A/S 

TITLE OF INVENTION: PURIFICATION OF HER-2 VARIANTS 
FILE REFERENCE: P1020DK00 
C--> 9 <140> CURRENT APPLICATION NUMBER: US/10/560,961 
C--> 9 <141> CURRENT FILING DATE: 2005-12-14 

NUMBER OF SEQ ID NOS : 2 
SOFTWARE: Patentin version 3.2 
SEQ ID NO: 1 
LENGTH: 5661 
TYPE: DNA 

ORGANISM: Artificial sequence 
FEATURE : 

OTHER INFORMATION: Recombinant expression plasmid derived from pMT 
FEATURE : 

NAME/KEY: polyA_signal 
LOCATION: (263) . . (268) 

OTHER INFORMATION: SV40 late polyadenylation site 
FEATURE : 

NAME/KEY: misc_feature 
LOCATION: (1579) .. (2439) 

OTHER INFORMATION: Ampicillin resistance gene, encoded by complementary strand 
FEATURE : 

NAME/KEY: promoter 
LOCATION: (3050) . . (3415) 

OTHER INFORMATION: Metallothionein promoter 
FEATURE : 
NAME /KEY: RBS 
LOCATION: (3493) . . (3501) 
OTHER INFORMATION: Kozak-like sequence 
FEATURE : 
NAME /KEY: CDS 
LOCATION: (3502) . . (5592) 

OTHER INFORMATION: DNA encoding immunogenic, his-tagged variant of human HER-2 
FEATURE : 

NAME/ KEY : sig_pept ide 
LOCATION: (3502) . . (3555) 
OTHER INFORMATION: BiP signal sequence 
FEATURE : 

NAME/KEY : misc_f eature 
LOCATION: (3556) . . (3597) 
OTHER INFORMATION: Histidine tag 
FEATURE : 
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34 


<222> 


35 


<223> 


37 


<220> 
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59 <222> LOCATION: (3556) . . (5589) 

61 <220> FEATURE: 

62 <221> NAME/KEY: misc_feature 

63 <222> LOCATION: (3598) (3603) 

64 <223> OTHER INFORMATION: Dipeptidase Stop sequence 

66 <220> FEATURE: 

67 <221> NAME/KEY: misc_f eature 

68 <222> LOCATION: (3604) (5589) 

69 <223> OTHER INFORMATION: Gene coding for the hHER2MA5-5DUH protein 

71 <220> FEATURE: 

72 <221> NAME/KEY: misc_f eature 

73 <222> LOCATION: (4357) . . (4401) 

74 <223> OTHER INFORMATION: Diphtheria toxoid P2 epitope 

76 <220> FEATURE: 

77 <221> NAME/KEY: misc_f eature 

78 <222> LOCATION: (5500) . . (5562) 

79 <223> OTHER INFORMATION: Diphtheria toxoid P30 epitope 

81 <400> SEQUENCE: 1 

82 ggccgctcga gtctagaggg cccttcgaag gtaagcctat ccctaaccct ctcctcggtc 60 
84 tcgattctac gcgtaccggt catcatcacc atcaccattg agtttaaacc cgctgatcag 12 0 
86 cctcgactgt gccttctaag atccagacat gataagatac attgatgagt ttggacaaac 180 
88 cacaactaga atgcagtgaa aaaaatgctt tatttgtgaa atttgtgatg ctattgcttt 240 
90 atttgtaacc attataagct gcaataaaca agttaacaac aacaattgca ttcattttat 3 00 
92 gtttcaggtt cagggggagg tgtgggaggt tttttaaagc aagtaaaacc tctacaaatg 360 
94 tggtatggct gattatgatc agtcgacctg caggcatgca agcttggcgt aatcatggtc 420 
96 atagctgttt cctgtgtgaa attgttatcc gctcacaatt ccacacaaca tacgagccgg 480 
98 aagcataaag tgtaaagcct ggggtgccta atgagtgagc taactcacat taattgcgtt 540 
100 gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc cagctgcatt aatgaatcgg 600 
102 ccaacgcgcg gggagaggcg gtttgcgtat tgggcgctct tccgcttcct cgctcactga 660 
104 ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca gctcactcaa aggcggtaat 720 
106 acggttatcc acagaatcag gggataacgc aggaaagaac atgtgagcaa aaggccagca 780 
108 aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc tccgcccccc 840 
110 tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga caggactata 900 
112 aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc 960 
114 gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt ctcatagctc 1020 
116 acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga 1080 
118 accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg agtccaaccc 1140 
12 0 ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta gcagagcgag 1200 
122 gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct acactagaag 1260 
124 gacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa gagttggtag 1320 
126 ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt gcaagcagca 1380 
128 gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta cggggtctga 1440 
130 cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgagattat caaaaaggat 1500 
132 cttcacctag atccttttaa attaaaaatg aagttttaaa tcaatctaaa gtatatatga 1560 
134 gtaaacttgg tctgacagtt accaatgctt aatcagtgag gcacctatct cagcgatctg 1620 
136 tctatttcgt tcatccatag ttgcctgact ccccgtcgtg tagataacta cgatacggga 1680 
138 gggcttacca tctggcccca gtgctgcaat gataccgcga gacccacgct caccggctcc 174 0 
140 agatttatca gcaataaacc agccagccgg aagggccgag cgcagaagtg gtcctgcaac 1800 
142 tttatccgcc tccatccagt ctattaattg ttgccgggaa gctagagtaa gtagttcgcc 1860 
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144 agttaatagt ttgcgcaacg ttgttgccat tgctacaggc atcgtggtgt cacgctcgtc 1920 

146 gtttggtatg gcttcattca gctccggttc ccaacgatca aggcgagtta catgatcccc 1980 

148 catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca gaagtaagtt 2040 

150 ggccgcagtg ttatcactca tggttatggc agcactgcat aattctctta ctgtcatgcc 2100 

152 atccgtaaga tgcttttctg tgactggtga gtactcaacc aagtcattct gagaatagtg 2160 

154 tatgcggcga ccgagttgct cttgcccggc gtcaatacgg gataataccg cgccacatag 2220 

156 cagaacttta aaagtgctca tcattggaaa acgttcttcg gggcgaaaac tctcaaggat 2280 

158 cttaccgctg ttgagatcca gttcgatgta acccactcgt gcacccaact gatcttcagc 2340 

160 atcttttact ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa 2400 

162 aaagggaata agggcgacac ggaaatgttg aatactcata ctcttccttt ttcaatatta 2460 

164 ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 2520 

166 aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 2580 

168 aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgttc 2640 

170 gcgcgtttcg gtgatgacgg tgaaaacctc tgacacatgc agctcccgga gacggtcaca 2700 

172 gcttgtctgt aagcggatgc cgggagcaga caagcccgtc agggcgcgtc agcgggtgtt 2760 

174 ggcgggtgtc ggggctggct taactatgcg gcatcagagc agattgtact gagagtgcac 2820 

176 catatgcggt gtgaaatacc gcacagatgc gtaaggagaa aataccgcat caggcgccat 2880 

178 tcgccattca ggctgcgcaa ctgttgggaa gggcgatcgg tgcgggcctc ttcgctatta 2 940 

180 cgccagctgg cgaaaggggg atgtgctgca aggcgattaa gttgggtaac gccagggttt 3000 

182 tcccagtcac gacgttgtaa aacgacggcc agtgccagtg aattaattcg ttgcaggaca 3060 

184 ggatgtggtg cccgatgtga ctagctcttt gctgcaggcc gtcctatcct ctggttccga 3120 

186 taagagaccc agaactccgg ccccccaccg cccaccgcca cccccataca tatgtggtac 3180 

188 gcaagtaaga gtgcctgcgc atgccccatg tgccccacca agagttttgc atcccataca 3240 

190 agtccccaaa gtggagaacc gaaccaattc ttcgcgggca gaacaaaagc ttctgcacac 3300 

192 gtctccactc gaatttggag ccggccggcg tgtgcaaaag aggtgaatcg aacgaaagac 3360 

194 ccgtgtgtaa agccgcgttt ccaaaatgta taaaaccgag agcatctggc caatgtgcat 3420 

196 cagttgtggt cagcagcaaa atcaagtgaa tcatctcagt gcaactaaag gggggatcta 3480 

198 gatcggggta ccaaagtcac c atg aag ttg tgc ate ttg ctg gcc gtc gtg 3531 

199 Met Lys Leu Cys lie Leu Leu Ala Val Val 

200 -15 -10 

202 gcc ttc gtg ggc ctg teg ctg ggc atg aag cac caa cac caa cat caa 3579 

203 Ala Phe Val Gly Leu Ser Leu Gly Met Lys His Gin His Gin His Gin 

204 -5 -1 1 5 

206 cat caa cat caa cat caa gcc ccc tec acc caa gtg tgt acc ggc aca 3627 

207 His Gin His Gin His Gin Ala Pro Ser Thr Gin Val Cys Thr Gly Thr 

208 10 15 20 

210 gac atg aag ctg egg etc cct gcc agt ccc gag acc cac ctg gae atg 3675 

211 Asp Met Lys Leu Arg Leu Pro Ala Ser Pro Glu Thr His Leu Asp Met 

212 25 30 35 40 

214 etc cgc cae etc tac cag ggc tgc cag gtg gtg cag gga aac ctg gaa 3723 

215 Leu Arg His Leu Tyr Gin Gly Cys Gin Val Val Gin Gly Asn Leu Glu 

216 45 50 55 

218 etc ace tac ctg ccc acc aat gee age tta agt ttc ctg cag gat ate 3771 

219 Leu Thr Tyr Leu Pro Thr Asn Ala Ser Leu Ser Phe Leu Gin Asp lie 

220 60 65 70 

222 cag gag gtg cag ggc tac gtg etc ate get cac aac caa gtg agg cag 3819 

223 Gin Glu Val Gin Gly Tyr Val Leu He Ala His Asn Gin Val Arg Gin 

224 75 80 85 

226 gtc cca ctg cag agg ctg egg att gtg ega ggc ace cag etc ttt gag 3867 
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tat 
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lie 
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Gly 


Val 
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He 


Gin 


Arg 
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cag 
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cag 


gac 


acg 


att 
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ate 


ttc 
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Trp 
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He 


Phe 
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cag 
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get 
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ctg 


ata 


gac 


ace 


aac 


cgc 


tct 


4107 
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His 


Lys 


Asn 


Asn 


Gin 


Leu 


Ala 
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Thr 
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He 
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Thr 


Asn 


Arg 


Ser 
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egg 


gcc 


tgc 


cac 


ccc 


tgt 


tct 


ecg 


atg 
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aag 


ggc 


tee 


cgc 


tgc 


tgg 


4155 


251 


Arg 


Ala 
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His 


Pro 


Cys 


Ser 


Pro 


Met 
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Lys 


Gly 


Ser 


Arg 


Cys 


Trp 
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200 
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gga 


gag 


agt 


tct 


gag 


gat 


tgt 


cag 


age 


ctg 


acg 


cgc 


act 


gtc 


tgt 


gee 


4203 


255 


Gly 


Glu 


Ser 


Ser 


Glu 


Asp 


Cys 


Gin 


Ser 


Leu 


Thr 


Arg 


Thr 


Val 


Cys 


Ala 




256 










205 










210 










215 






258 


ggt 


ggc 


tgt 


gee 


cgc 


tgc 


aag 


ggg 


cea 


ctg 


eec 


act 


gac 


tgc 


tgc 


eat 


4251 
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Gly 


Gly 


Cys 


Ala 


Arg 
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Gly 


Pro 


Leu 


Pro 


Thr 


Asp 


Cys 


Cys 


His 
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220 










225 










230 








262 


gag 


cag 


tgt 


get 


gee 


ggc 


tgc 


acg 


ggc 


ccc 


aag 


cac 


tct 


gac 


tgc 


ctg 


4299 


263 


Glu 


Gin 


Cys 


Ala 


Ala 


Gly 


Cys 


Thr 


Gly 


Pro 


Lys 


His 


Ser 


Asp 


Cys 


Leu 




264 






235 










240 










245 










266 


gcc 


tgc 


etc 


cac 


ttc 


aac 


cac 


agt 


ggc 


ate 


tgt 


gag 


ctg 


cac 


tgc 


cea 


4347 


267 


Ala 


Cys 


Leu 


His 


Phe 


Asn 


His 


Ser 


Gly 


lie 


Cys 


Glu 


Leu 


His 


Cys 


Pro 




268 




250 










255 










260 












270 


gcc 


ctg 


gtc 


cag 


tac 


ate 


aaa 


get 


aac 


tee 


aaa 


ttc 


ate 


ggt 


ate 


ace 


4395 


271 


Ala 


Leu 


Val 


Gin 


Tyr 


lie 


Lys 


Ala 


Asn 


Ser 


Lys 


Phe 


He 


Gly 


He 


Thr 




272 


265 










270 










275 










280 




274 


gag 


ctg 


egg 


tat 


aca 


ttc 


ggc 


gee 


age 


tgt 


gtg 


act 


gee 


tgt 


ccc 


tac 


4443 


275 


Glu 


Leu 


Arg 


Tyr 


Thr 


Phe 


Gly 


Ala 


Ser 


Cys 


Val 


Thr 


Ala 


Cys 


Pro 


Tyr 




276 










285 










290 










295 






278 


aac 


tac 


ctt 


tct 


acg 


gac 


gtg 


gga 


tec 


tgc 
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gtc 


tgc 


ccc 


ctg 


4491 
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Asn 


Tyr 


Leu 


Ser 


Thr 


Asp 


Val 


Gly 


Ser 


Cys 


Thr 


Leu 


Val 


Cys 


Pro 


Leu 




280 








300 










305 










310 








282 


cac 


aac 


caa 


gag 


gtg 


aca 


gca 


gag 


gat 


gga 


aca 


cag 


egg 


tgt 


gag 


aag 


4539 


283 


His 


Asn 


Gin 


Glu 


Val 


Thr 


Ala 


Glu 


Asp 


Gly 


Thr 


Gin 


Arg 


Cys 


Glu 


Lys 




284 






315 










320 










325 










286 


tgc 


age 


aag 


ccc 


tgt 


gcc 


cga 


gtg 


tgc 


tat 


ggt 


ctg 


ggc 


atg 


gag 


cac 


4587 


287 


Cys 


Ser 


Lys 


Pro 


Cys 


Ala 


Arg 


Val 


Cys 


Tyr 


Gly 


Leu 


Gly 


Met 


Glu 


His 




288 




330 










335 










340 












290 


ttg 


cga 


gag 


gtg 


agg 


gca 


gtt 


acc 


agt 


gee 


aat 


ate 


cag 


gag 


ttt 


get 


4635 


291 


Leu 


Arg 


Glu 


Val 


Arg 


Ala 


Val 


Thr 


Ser 


Ala 


Asn 


He 


Gin 


Glu 


Phe 


Ala 
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345 










350 










355 










360 




294 


ggc 


tgc 


aag 


aag 


ate 


ttt 


ggg 


age 


ctg 


gca 


ttt 


ctg 


ccg 


gag 


age 


ttt 


4683 


295 


Gly 


Cys 


Lys 


Lys 


lie 


Phe 


Gly 


Ser 


Leu 


Ala 


Phe 


Leu 


Pro 


Glu 


Ser 


Pne 




296 










365 










370 










375 






298 


gat 


ggg 


gac 


cca 


gcc 


tec 


aac 


act 


gcc 


ccg 


etc 


cag 


cca 


gag 


cag 


etc 


4731 


299 


Asp 


Gly 


Asp 


Pro 


Ala 


Ser 


Asn 


Thr 


Ala 


Pro 


Leu 


Gin 


Pro 


Glu 


Gin 


Leu 




300 








380 










385 










390 








302 


caa 


gtg 


ttt 


gag 


act 


ctg 


gaa 


gag 


ate 


aca 


ggt 


tac 


eta 


tac 


ate 


tea 


4779 


303 


Gin 


Val 


Phe 


Glu 


Thr 


Leu 


Glu 


Glu 


He 


Thr 


Gly 


Tyr 


Leu 


Tyr 


He 


Ser 




304 






395 










400 










405 










306 


gca 


tgg 


ccg 


gac 


age 


ctg 


cct 


gac 


etc 


age 


gtc 


tte 


cag 


aac 


ctg 


caa 


4827 


307 


Ala 


Trp 


Pro Asp 


Ser 


Leu 


Pro 


Asp 


Leu 


Ser 


Val 


Phe 


Gin 


Asn 


Leu 


Gin 




308 




410 










415 










420 












310 


gta 


ate 


egg 


gga 


cga 


att 


Ctg 


cac 


aat 


ggc 


gcc 


tac 


teg 


ctg 


acc 


ctg 


4875 


311 


val 


lie 


Arg Gly 


Arg 


lie 


Leu 


His 


Asn 


Gly 


Ala 


Tyr 


Ser 


Leu 


Thr 


Leu 




312 


425 










430 










435 










440 




314 


caa 


ggg 


ctg 


ggc 


ate 


age 


tgg 


ctg 


ggg 


ctg 


cge 


tea 


ctg 


agg 


gaa 


ctg 


4923 


315 


Gin 


Gly 


Leu Gly 


lie 


Ser 


Trp 


Leu 


Gly 


Leu 


Arg 


Ser 


Leu 


Arg 


Glu 


Leu 




316 










445 










450 










455 






318 


ggc 


agt 


gga 


ctg 


gee 


etc 


ate 


cac 


eat 


aac 


ace 


cae 


etc 


tge 


tte 


gtg 


4971 


319 


Gly 


Ser 


Gly Leu 


Ala 


Leu 


lie 


His 


His 


Asn 


Thr 


His 


Leu 


Cys 


Phe 


Val 




320 








460 










465 










470 








322 


cac 


acg 


gtg 


ecc 


tgg 


gac 


cag 


etc 


ttt 


egg 


aac 


ccg 


cac 


caa 


get 


ctg 


5019 


323 


His 


Thr 


Val 


Pro 


Trp 


Asp 


Gin 


Leu 


Phe 


Arg 


Asn 


Pro 


His 


Gin 


Ala 


Leu 




324 






475 










480 










485 










326 


etc 


cae 


act 


gcc 


aac 


egg 


cca 


gag 


gac 


gag 


tgt 


gtg 


ggc 


gag 


ggc 


ctg 


5067 


327 


Leu 


His 


Thr 


Ala 


Asn 


Arg 


Pro 


Glu 


Asp 


Glu 


Cys 


Val 


Gly 


Glu 


Gly Leu 




328 




490 










495 










500 












330 


gcc 


tge 


cac 


cag 


ctg 


tge 


gcc 


cga 


ggg 


cae 


tgc 


tgg 


ggt 


cca 


ggg 
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